Picture for Rongtao Xu

Rongtao Xu

Matching with Deliberation: Test-Time Evolutionary Hierarchical Multi-Agents for Zero-Shot Compositional Image Retrieval

Add code
May 21, 2026
Viaarxiv icon

RePO-VLA: Recovery-Driven Policy Optimization for Vision-Language-Action Models

Add code
May 10, 2026
Viaarxiv icon

Region Matters: Efficient and Reliable Region-Aware Visual Place Recognition

Add code
Apr 24, 2026
Viaarxiv icon

LaplacianFormer:Rethinking Linear Attention with Laplacian Kernel

Add code
Apr 22, 2026
Viaarxiv icon

AnySlot: Goal-Conditioned Vision-Language-Action Policies for Zero-Shot Slot-Level Placement

Add code
Apr 14, 2026
Viaarxiv icon

A1: A Fully Transparent Open-Source, Adaptive and Efficient Truncated Vision-Language-Action Model

Add code
Apr 07, 2026
Viaarxiv icon

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation

Add code
Mar 30, 2026
Viaarxiv icon

HMR-1: Hierarchical Massage Robot with Vision-Language-Model for Embodied Healthcare

Add code
Mar 09, 2026
Viaarxiv icon

\textsc{NaVIDA}: Vision-Language Navigation with Inverse Dynamics Augmentation

Add code
Jan 26, 2026
Viaarxiv icon

MoFu: Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

Add code
Dec 26, 2025
Viaarxiv icon